IL64usl 

AN NROM FABRICATION METHOD 

Boaz EITAN 

FIELD OF THE INVENHON 

The present inveuticKa relates generally to methods of fehricadoa of nitride read 
only memory (NROM) cells and arrays. 

BACKGROUND OF THE INVENTION 

Fig. 1, to 'widch reference is made, illustrates a typical prior art NROM cell. 
This cell incliides a substrate 10 in which are implanted a so-urce 12 and a drain 14 and on top of 
which lies an oxide-nitride-oxide (ONO) structure 16 having a layer of nitride 17 sandwiched 
between two oxide layers 18 and 20, On top of the ONO structure 16 lies a gate conductor 22. 
Between source 12 and drain 14 is a channel 15 formed under ONO structure 1 6. 

Nitride section 17 provides the charge retention mechanism for programming the 
memory cell. Specifically, when programming voltages are provided to source 12, drain 14 and 
gate conductor 22, electrons flow towards drain 14, According to the hot electron injection 
phenomenon, some hot electrons penetrate through the lower section of silicon oxide 18, 
especially if section 1 8 is thin, and are then collected in nitride section 17. As is known in the 
art;, nitride section 17 retains the received charge, labeled 24, in a concentrated area adjacent 
drain 14. Concentrated charge 24 significaatly raises the threshold of the portion of the channel 
of the memory cell under charge 24 to be higher than the threshold of the temaimag portion of 
the channel 15. 
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when conceatrated change 24 is preseqit (i,e, the cell is programmed), the raised 
threshold of ihe cell does not peimit the cell to be placed into a conductive state during reading 
of the cell- If concenlrated charge 24 is not present, flie read voltage on gate conductor 22 can 
overcome liie much lower threshold aod accoidingly, channel 15 becomes inverted and hence, 
condiictive* 

US Application Serial No. 08/861,430 filed July 23, 1996 and owned by the 
common inventor of the pres^ invention describes an icaproved NROM cell, which is 
programmed in one direction and read in the reverse direction. 

It is noted that the threshold voltage Vlh of NROM cells is generally very 
sensitive to the voltages Vdrain and Vgate provided on the drain 14 and on the gate 22, 
respectively. Furfhcmiore, U.S. Application Serial No. 08/861,430 selects the voltages Vdrain 
and Vgate are selected in order to ensure that tbe charge trapped in a portion of Hie nitride layer 
1 7 remams localized in ^lat portion. 

Kjead only memoiy cells, including a nitride layer in tiie gate dielectric (NROM) 
are described, inter alia, in US patents 5,168,334 to Mitchell et al. and 4,173,766 to Hayes. 

Mitchell et al. describe two processes to produce the NROM cells. In the first 
process^ bit lines are first created in the substrate, ajfler winch the surfece is oxidized. Following 
the oxidadon, the ONO layers are added over the entue airay, Polysilicon word lines arc then 
deposited in rows over the ONO layers. Unfortunately, when an oxide layer is grown (typically 
under high temperature), the already present bit lines ^wiU dijSEuse to the side, an undesirable 
occurraice ^?rflich limits the extent to which the cell si2e can be shrunk. 

la the second process, the ONO layers are formed over the entire array first, on 

top of which conductive blocks of polysilicon are formed. The bit lines arc implanted between 

the blocks of polysilicon after which the ONO layers are etched away from on top of the bit 
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lines. Planarized oxide is then deposited between the polysilicon blocks after which polysilicon 
word lines are deposited. Mitohell et al, utilize a planarized oxide since such can be deposited 
rather than grovm, Mitchell et al, cannot grow an oxide over the bit lines since such an 
oxidation operation would also grow oxide over the polysilicon blocks and the latter must be 
left with a very clean surface in order to connect with the polysilicon word lines. Unfortunately, 
planarized oxide is not a clean oxide nor docs it seal around tbe edges of the ONO sections 
Fmtlaeimore, the planarized oxide adds coroplexity and cost to the process. 

Hayes et al. describe an NROM cell having only an oxide-nitride (ON) layer- 
The cells in the array are created by forming layers of oxide, nitride and polysilicon (the latter to 
produce the gate) one after anolber and then pattenmig and etching these layers to form the on 
cells. The uncapped nitride in each cell does not hold charge well in both the vertical and lateral 
directions. Due to hole and hot electron conduction within the nitride, tiie charge to be stored 
will flow vertically towards the gate coveting it unless the nitride is ibick and will flow laterally 
in the nitride in response to lateral electric fields. 



SUMMARY OF THE PRESENT INVENTION 

It is an object of the present inventiojtx to provide a method of fabricating NROM 
cells and NROM cell arrays with impioved data retention. 

Thexe is therefore provided, in accotdance with a preferred embodiment of the 
5 present invention, a method of fabricating an oxide-nitride-oxide (ONO) layer in a memory cell 
to retain charge in the nitride layer. The method includes the steps of forming a bottom oxide 
layer on a substrate, depositing a nitride layer and oxidizing a top oxide layer, thereby caxising 
oxygen to be introduced into the nitride layer. 

Alternatively, in accordance witii a preferred embodiment of the present 
P 10 invention, the tn^thod inchides the steps of forming a bottom oxide l5^r on a substrate, 
S depositing a nitride layer, oxidizing a portion of atop oxide layer thereby causing oxygen to be 

introduced into Ihe nitride layer and depositing a remaining portion of the top oxide layer, 
a S thereby assisting in controlling the amount of oxygen introduced into the nitride layer. 

^ Further, in accordance with a preferred CTabodiment of the present invention, the 

Z 15 m^od inckides the steps of formins a bottom oxide I^yer on a substrate, depositing a nitride 
P layer, depositing a portion of a top oxide lay^ and oxidizing a lemamh^ portion of the top 

P oxidelayer^thereby causixig oxygen to be introduced into the intride layer. 

There is provided, in accordance with a prefiaired embodiment of the 
present invention, a method for unproving the charge retention in a nitride layer of a 
20 memory chip. Hie method includes 1h& steps of depositing a nitride layer and 
introducing oxygen into the nitride l^er, 

AltOTiatively, in accordance with a preferred embodiment of the present 
invention, the method includes the steps of d^ositing a nitride layer, controlling the 

thickness of the deposited nitride layer and introducing oxygen into the nitride layer. 
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Further, m accoxdaiice with a preferred embodiment of the ptesent 
iaveiition, the method inclmies the steps of forming a bottom oxide layer oaat a substrate, 
depositiiig a mtride layer at a ihidmess approximate to ihc final thickness after 
fabrication, depositing a portion of a top oxide layer and oxidizing a temaining 
portion of the top oxide layer, thereby assisting in controlling the introduction of oxygpn 
into the nitride layer. 



BRIEF D£SCSIPTION OF THE DRAWINGS 

Hie present invetitioii will be understood mi appreciated more fiiUy firom the 
foliowixsg detailed descriptLon takea in coi^uziction with the dzawings in which: 

Fig. lis a schematic illustration of a prior art NROM memory cell; 

Fig. 2 is a schematic illustration of the NROM memory chip after an 
oxide-nitride-oxide layer hz& been laid down; 

Fig. 3 A is a schematic iUuslration in top view of a bit line implant mask; 

Fig. 3B a cross section of a portion of the memory array of tibe chip of Fig. 2 
after the mask of Fig. 3 A is laid down and after etching away the exposed portions of the ONO 
layer leaving part of tiie bottom oxide layei^ 

Fig- 3C shows the cross section of Fig- 3B acfter an implant of an impurity to 
fi>rm the bit lines in the memory array portion of the chip of Fig. 3B; 

Fig. 4 shows in cross section the memory array portion of the chip of Fig. 3C 
after oxidation of the bit lines; 

Fig. 5 is a schematic illustration of an ONO protect mask for the memory array 
and periphery sections of the chip; and 

Figs 6A and 6B arc schematic illustcations of the memory array portion of the 
chip of the present iave^ation after a polysilicon or polysilicide layer 60 has been laid down, in 
top and side view, respectively. 
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DETAILED DESCRIPTION OF PREFERRED EMBODBVIENTS 

Reference is now made to Figs. 2, 3A, 3B, 3C, 4, 5, 6A and 6B, which illustrate 
the NROM fabrication method of the present invention. Similar reference numerals herein refer 
to similar elements. It is noted tliat the present invention covers the fabrication of the entire 
chip, which includes the NROM memoiy array portion and the complementary metal oxide 
semiconductor (CMOS) periphery devices. 

In the following discussion, the process of etching a layer, which includes 
placLag photoresist on the layer, placing a mask on the photoresist, etching wherever the mask is 
not and removing the photoresist, will not be detailed. 

The method begins v^ith a standard complementary metal oxide semiconductor 
(CMOS) initial process for preparing the substrate 10 including N well formation and field 
oxide formation, A screen oxide layer is then grown (not shown) on substrate 10 after which it 
is removed, typically with a wet etch thereby to remove any residual nitride at the edge of the 
field. A t>T2ical thickness of the screen oxide layer is 200 - 400A 

Substrate 10 is tlien overlaid with an ONO layer, A bottom oxide layer 30 is 
growa over substrate 10 typicall}^ to a thickness of between 50A and 150A ia a low temperature 
oxidation operation. A typical oxidation temperatme is about SOO'^C but it can var>' between 
750 - lOOO'^C. A preferred thickness of the bottom oxide layer 30 is 80A. 

A nitride layer 32 is then deposited over bottom oxide layer 30 to a thickness of 
between 20 A and 15 OA where a preferred thickness is as thin as possible, such as lOA - 5 OA. 
Applicant notes that a thin layer of the nitride prohibits lateral movement of tire charge retained 
within the nitride, and hence, it is beneficial to control the thickness of nitride layer 32. 

Top oxide 34 is then produced either through oxidation of the nitride (i.e. 

mowing of the oxide), or by deposition or by a combination thereof. It is noted that top oxide 
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34 consuraes nitride during oxidation, where typically half of the oxide thickness comes from 
the consumed nitride. Thus, if it is desired to have a top oxide which is 100 A thick, the nitride 
layer 32 should be at least 50A thicker tlian the fmal desired nitride thickness, with this extra 
nitride being for consumption in the formation of die top oxide layer. 

It is also noted that, during oxidation of top oxide layer, some of the oxygen is 
introduced into the non-consumed nitride layer. 

Ultimately, as is described hereinbelow, nitride layer 32 is transformed into 
nitride section 17, which provides the charge retention mechanism for the memory cell. Nitride, 
particularly due to its structure, traps the electrons, which are introduced into nitride section 17. 
Oxygen however, is a better insulator than nitride and helps to minimize the lateral movement 
of electrons in nitride layer 32. It is thus an important element for effective retention of the 
charge. It is tlierefore noted that one of the factors effecting the quality of retention ability of 
nitride section 17 is the concentration of oxygen witliin nitride layer 32. The oxygen 
concentration is defined as the percentage of oxygen atoms relative to the nitride atoms, 
irrespective of the type of molecule in which the oxygen atoms are found. The concentration 
can range jSrom a low of 10% to a high of 80%. 

Hence, in order to produce a retentioxi layer, which provides effective clwge 
retention, it is recommended to introduce a high percentage of oxygen into the nitride. 
Nonetlieless, if the oxi-nitride composition is too oxygen rich, even though nitride is essentially 
an oxidation barrier, a run-away situation is produced whereby nitride layer 32 absorbs too 
much oxygen and ceases to act as a barrier for oxygen diffusion. In such an instance, the 
oxygen inU'oduced into ihe oxygen rich nitride layer 32 reaches the sihcone oxide layer 18, and 

become SIO2. 

In summary, in order to produce a rjitride section 1 7 with maximum retention 
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qvialities, it is desirable to make mtride layer 32 as thin as possible, with the maximum oxygen 
concentration, without inducing a run-away situation. Consequently, it is critical to control the 
fabrication the ONO structure, and specifically, the mamer m wMch the top oxide 34 is 
produced. 

The top oxide is typically of a tiiiickness of between 50A and 150A- Tliree 
alternative operations for creating atop oxide 34 of lOOA are described hereinbelow. 

The fbrst method involves depositing nitride layer 32 of approximately 
150-160A, growing 120 - 130A of top oxide 34, (which includes consuming 60 - 65 A of nitride 
layer 32) and removing 20 - 30A of oxide layer 34 dxiring cleaning. Since a large portion of 
nitride layer 32 is consumed, it is difiScult to control the amount of oxygen introduced into 
nitride layer 32. Thus, in order to avoid a possibility of run-away conditions in the mtride layer, 
it is essential to "leave" a thicker mtride layer. This alternative produces a thicker nitride layer; 
however it provides for high mtroduction of oxygen into the nitride and is a simple process to 
perfoim. 

The second method involves depositing nitride layer 32 at a thickness of 
approximately 60A, growing a thin layer of oxide layer 34 (approximately 40A-) while 
consuming about 20A of nitride, depositing 80-90A and removing 20-3 OA during cleaning. 
Since depositing oxide is a quicker process than growing oxide, this alternative is quicker than 
die first alternative and it offers m^ginally better control over the amount of oxygen introduced 
into nitride layer 32 

It is noted tliat the longer the oxidation process continues the greater the effect 

on previously produced layers. Therefore, in order diminish the effect on previous layers, it is 

desirable to create the top oxide layer as quickly as possible. 

The third method involves depositing nitride layer 32 at a thickness close to the 
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preferred final thickness, such as 20A, depositing 100 - 11 OA of oxide, growing 2 - 5A of oxide 
and removing 20 - 30A of oxide during cleaning. When growing oxide after it has been 
deposited^ the deposited layer acts a$ a hairier between the growing oxide and nitride layer 32, 
Hence, the oxygen is introdixced slowly into nitride layer 32. This alternative is slower than the 
previous alternatives; however, it provides a thin nitride layer and a more controlled manner for 
regulating the introduction of oxygen into the nitride layer. 

The process by which the nitride and top oxide layers are generated depends on 
tlie ability of the manufacl-uring facility to control the ihiclcness and composition of the layers of 
the ONO struclme. 

At this poiQt, the entire substrate 10 is covered with an ONO layer, as shown in 
Fig. 2. Hie next step involves depositing a bit line mask 40 (typically photoresist 42 patterned 
in a well Icnown marnier), whose layout within the memory array portion of the chip is shown in 
Fig- 3 A, to create the bit lines, forming lines of sources and lines of drains. Fig. 3B illustrates a 
portion of the resultant chip within the memory array portion with tiae photoresist 42 patterned. 
Fig. 3B is a side view (similar to Fig, 2) with the columns 42 of the bit line mask in place. 
Photoresist columns 42 define the areas where the bit lines are not to be implanted (i.e. the 
locations of the channels 15 (Fig. 1)). 

Prior to implanting the bit lines, the top oxide and nitride layers 32 and 34, 
respectively., are etched av(/ay from the areas between columns 42. The etch operation is 
typically a dry etch which might also etch a portion 44 of bottom oxide layer 30 which is 
between coluixuis 42, leaving portion 44 with a predetermined tbiclcness, such as 50A. The etch 
operation produces oxide sections 18 and 20 and nitride section 17 under each column 42. 

After the etch operation, bit lines 12 are implanted (Fig. 3C) in the areas bet-ween 

columns 42. A typical implant might be 2 - 4xlO^Vcm2 of Arsenic at 50Kev. It will be 
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appreciated tliat tliis is a self-aligrxed implant in which the bit Unes are self-aligned to the ONO 
structures. 

The photoresist layer 42 is then removed and bit line oxides 50 (Fig. 4) are then 
thermally grown over the bit lines 12 in an oxidation operation. At the same time, side oxides 
51, typically of 30A, axe grown along the sides of nitride layers 17 to improve data retention 
within tlie nitride layers. The oxidation typically occurs in the range of 800°C to 950°C but 
preferably at the lower side of this range to minimize the diffusion of the bit line impurity while 
maximizing the thickness of the thermal oxide. This lowers the bit line capacitance. The 
oxidation temperature also activates the implanted bit line impxirities. 

Thus the typical oxidation process is a low temperature oxidation of about 
800°C which, on a P- substrate^ normally is continued for a time sufiScient to grow the 
equivalent of lOOA of thermal oxide. On the chip of the present invention, however, top oxide 
sections 20 will not significantly increase in thickness during the bit line oxidation due to the 
close presence of nitride sections 18 wMe oxide layer 44 over the bit lines 12 will increase 
significantly due to the presence of Arsenic in the bit Hnes 12. The result is that the bit line 
oxides 50 are typically very thick, such as 500A thiclc, fliereby lowering the bit line capacitance. 

It will be appreciated that tlie present invention separates the creation of bottom 
oxide sections 18 (and thus, of the entire ONO structure 16) from the creation of bit line oxides 
50- Bottom oxide sections 18 are created over the entire array as part of creatiug the ONO 
structures- Bit Ime oxides 50 are created during the bit line oxidation operation and this 
oxidation does not significantly affect the oxide layers in the ONO structures. Fiirthermore, bit 
line oxides 50 are self-aligned to the ONO structures and, since the oxidation operation is at a 
relatively low taraperatuie, bit lines 12 do not significantly difflise into substrate 10 duriag the 
oxidation operation* 
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It will jEurfher be appreciated that the ONO layers have been laid down on the 
entire chip and thus, are present in tlae periphery, hi accordaDce with a preferred embodiment of 
the present invention, the ONO layers can be utilized as thick gate oxides in the portions of the 
periphery where tliicker oxides are needed. Thus, if two gate dielectric thicknesses are required 
in the periphery, the present invention provides one gate dielectric using the ONO layers and the 
second, thirmer gate dielectric can be produced in a separate gate oxide production step. 
Furthermore, as shown in Fig. 5, a single mask 52 can be utilized to mark both the locations 54 
of the thick gate oxides as well as to protect the memory array (area 56) while etchitig and 

oxidizing the periphery. 

Mask 52 can be utilized in one of two alternative ways, hi the first embodiment, 
a threshold level adjustment implant for tiie peripheral transistors is performed after maslc 52 is 
laid down and patterned. This provides the periphery with a threshold level different from that 
of the memory array area 56. In the second embodiment, the threshold level adjustment implant 
is performed on the entire chip prior to laying down mask 52. hi this embodiment, mask 52 
serves only to mark the locations where the ONO layers are to be removed. 

Specifically, in the first embodiment, after mask 52 is laid down, the tlireshold 

voltage level adjustment is performed. This procedure involves implaathig boron through the 

ONO layers into the portions of the periphery of the chip not covered by mask 52. Typically, 

there ai-e tv.ro adjustment steps, one each for ihe n-channel and p-channel transistors. It will be 

appreciated that, in accordance with a preferred embodiment of the present invention, the 

adjustment implant is performed through the ONO layers smce they are not yet capped and tlius, 

do not block the implant operation. It will further be appreciated that, for the threshold 

adjustment procedure, the to-be-removed ONO layers act as a sacrificial oxide (e.g. an oxide 

grown for an implant operation and immediately thereafter removed). 
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Following the threshold voltage adjustment procedure, the ONO layers on the 
unmasked portions of the chip are removed. Initially, a dry oxide etch is utilized to remove top 
oxide 34 and nitride 32 layers after which a wet etch is utilized to remove bottom oxide layer 
30. Following removal of mask 52, a gate oxide (not shown) of typically 100 - 150A is 
thermally grown over the entire chip. Due to the presence of nitride in the memory array, the 
gate oxide step does not significantly affect the thiclcness of top oxide 20. However, this step 
creates gate oxides for the transistors in the periphery. 

It will be appreciated that the gate oxide thickness is thus independent of the 
thicknesses of the bit line oxide 50 and top oxide 20. 

In a second embodiment, mask 52 is laid down after the gate and threshold 
voltage level adjustment procedure is performed. Thus, the memory array portion of the chip 
also receives threshold level adjustments. Witlr mask 52 in place, the ONO layers on the 
unmasked portions of the chip are removed, as described hereinabove. Once again, the ONO 
layers act as a sacrificial oxide, eliminating the necessity for the additional sacrificial oxide 
operations. 

Finally, following removal of mask 52, a gate oxide is grown over the entire 
array, creating gate oxides in the periphery only. 

Following the gate oxide growth step, a polysilicon layer^ which will create word 

lines for the memory array portion and will create gates for the periphery transistors, is laid 

down over the chip. If desired, a low resistive silicide, as is l<nown in the art, can be deposited 

over die polysilicon layer in order to reduce its resistivity. This creates a "'polysiHcide" layer. A 

typical total thickness of the polysilicide miglit be 0.3 - 0.4fj,m- As indicated by Fig. 6A, the 

polysilicide or polysilicon layer is then etched usmg a mask into word lines 60 within the 

memory array. Topically the word line etch also etches at least the top oxide 20 and the nitride 
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17 from between the word lines 60, This improves the charge retention of the memory cells by 
isolating the nitride layers 17 of each transistor. 

Pig. 6B illustrates one row of the resultant memory array in side view. The 
polysilicide or polysilicon layer 60 lies on top of the ONO structures 16 (Fig. 4), thereby 
forming the gates 22 (Fig. 1) of the NROM cells. Bit line oxides 50 are thick enough to isolate 
neighboring ONO structures 16. 

The memory chip is tlien finished in the standard ways, including a side wall 
oxidation step (typically a self-ahgned step), a Hghtly doped drain (Ldd) implant procedure into 
the CMOS periphery only and a spacer deposition Fig, 6A illustrates the location of tl^e spacers 
62 as being along the sidewalls of the polysilicon word lines 60. The Ldd typically requires 
separate masks for the n-chaimel and p-channel periphery transistors. 

It will be appreciated that, in the present invention, the tbicknesses of the vax'ious 
elements of the NROM cell are generally independent of each other. For example, the 
thicknesses of the bottom oxide, nitride and top oxide layers arc typically selected as a function 
of the desired operation of the meinox>' array, the bit line oxide is independent of the thickness 
of bottom oxide ONO structure and the gate oxide of the periphery is independent of the otiaer 
two oxide (i.e., the bit line oxide and the bottom ONO oxide) thicknesses. 

It will be appreciated by persons skilled in tiic art that the present invention is 
not limited to what has been particularly shown and described hereinabove. Ratlier the scope of 
tlie present hrvention is defined only by the claims which follow: 
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